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O , Abstract 

^ In 2002 Jurdzinski and Lorys settled a long-standing conjecture that palindromes are not a 

^ . Church-Rosser language. Their proof required a sophisticated theory about computation graphs 

G^ I of 2-stack automata. We present their proof in terms of 1-tape Turing machines. 
^ ■ We also provide an alternative proof of Buntrock and Otto's result that the set of bitstrings 

t;:;^- ■ {x : (Vy)x 7^ y^}, which is context-free, is not Church-Rosser. 

d 
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^ '■ 1 Introduction 



In the 1970s, Nivat lfT3l began the study of languages defined by Thue systems: see also [I5I1B . Book 
^ ■ [[2I continued the study of Church-Rosser Thue systems, and the theory has been much extended 
since then |[3l9]|. 

We follow the definitions of length-reducing Thue systems, etcetera, in [|3l. A Thue system S is 
Church-Rosser if whenever 

there exists a string w such that u-^w and v^w. Equivalently, every congruence class contains 
exactly one irreducible string. The redexes, reducts, and irreducible strings, with respect to S, are 
denoted Redexes(5'), Reducts(5'), and Irred(S'). 

Pal denotes the set of (bitstring) palindromes: those bitstrings which read the same backwards 
as forwards, namely. 

Pal = {x e {0, 1}* : = x} 

where x^ is the reversal of x. 

Church-Rosser languages will be described below. They are a surprisingly powerful generali- 
sation of congruential languages, which are finite unions of congruence classes of a finite Church- 
Rosser Thue system. 
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In [[T]| it is shown that Pal is not a congruential language. This is proved by contradiction. 
Otherwise, by definition, Pal is a finite union of congruence classes of a Thue system tQ 
However, the linguistic congruence =pal is the identity relation. It is defined by 

X =pal y <^=^ (def.) (Wu,v){uxv G Pal <^=^ uyv G Pal). 

If X and y are different bitstrings, suppose without loss of generality that |x| < \y\ and y ends in 1. 
Then 

XxO^V 

is not a palindrome but XyO^^^y^ is. Thus x =pal y <^==^ x = y. But would be a refinement of 
=pal, so Ay would be the identity relation, and Pal, being infinite, would not be a finite union of 
congruence classes modulo T. 

(1.1) Definition A language L is Church-Rosser [fTOl if there exists a Church-Rosser Thue system S 
and strings ti, t2, and t^, such that 

{h}-L-{t2} = [h]s. 

We assume without loss of generality that is irreducible, so x E L if and only iftixt2^s't3 
We only consider languages L C {0, 1}*. The alphabet of S may include {0, 1} properly. 

Church-Rosser languages were introduced in 1984 by Narendran ifTTI . and studied in ifTOl by 
McNaughton, Narendran, and Otto. 

Book O had shown that if 5 is a Church-Rosser Thue system then reduction (modulo S) could 
be executed in linear time on a 2-stack automaton. Therefore Church-Rosser languages can be recog- 
nised on a "shrinking" deterministic 2-stack automaton. Two papers by Buntrock and Otto flU and 
Niemann and Otto [[T2l together showed that such automata characterise the Church-Rosser lan- 
guages. 

An early conjecture by McNaughton, Narendran, and Otto IfTOll was that the language of bitstring 
palindromes is not Church-Rosser. This conjecture remained open until it was proved by Jurdzihski 
andLorys in 2002 0. 

Jurdzihski and Lorys' proof (see jSl) is difficult, requiring a complex theory of computation 
graphs for two-stack automata. In this note we propose a simplified proof based on 1-tape Turing 
machines. 

2 1-tape reduction machine 

Given a Church-Rosser Thue system S, we exhibit a 1-tape Turing machine TM implementing re- 
duction modulo S in a systematic way. While Book's 2-stack machine is more efficient BI6II . the 
advantage of studying reductions on a 1-tape Turing machine is that blanks are steadily accumulated, 
allowing us to see where information has been lost. 

'Turing machine' will mean a deterministic machine with quintuple instructions and 2-way infi- 
nite tape, although the worktape used will be only slightly longer than the input string. An instruction 
(quintuple) has the form 

'That T is Church-Rosser does not affect the argument. 



current state, current symbol, new symbol, head movement, new state 
where the head movement is 1 square left or right (the read/write head moves at every step). 
Given a language L such that 

ti - L ■t2 — [tsls, 

on input x the machine TM converts the tape contents to tixt2, reduces it, and compares the result to 
ts. Let E be the smallest alphabet such that 

S* contains Redexes(S'), Reducts(S'), {ti, t2, h}, and L. 

The machine TM executes reductions systematically. If a string z is reducible, then it has a 
leftmost redex, i.e., it can be written as wut where m is a redex and no proper prefix of wu is reducible. 

The set of such strings wu is regular and one can easily describe a DFA D which recognises this 
set, and has the property that when it accepts wu, one such redex u, and hence a rule u ^ v, is 
determined uniquely by its accepting state. Ties are broken arbitrarily. 

Let K be the set of states of D. 

The worktape alphabet of TM consists of 

• E, a new blank symbol B, and left and right sentinel characters and $. 

• Compound symbols [a, k] where a G S U {B} and k e K (the states of D). 

The blank symbols are 

B and {[B,k] : k e K}. 

(2.1) Write h for the following homomorphism. 

{z if z e s, 
a if z = [a, k], a e T,, k e K , 
X otherwise 

Let ko be the initial state of D and 6 the transition function for D. We extend S to K x {HiJ {B}): 

6{k, B) = k, ke K. 
A string [ai, ki] [a2, /C2] . . . [a^, kn\ of compound symbols is historical if for all j < n, 

kj — 5*{ko, 0x02 . . . %). 

Obviously, 

kj+i = 6{kj, aj+i), < j < n - 1. 

(2.2) Definition The string (/:tixt2$ (including endmarkers) is called the initial redex on input x. 
The machine TM creates the intial redex, then reduces as often as needed. 

• Its configurations are represented in the form aqP where aP are the tape contents, including 
on the left and $ on the right, /5 7^ A (so $ is the rightmost symbol in P), q is the current state, 
and the machine is scanning the first symbol of 



• Except for the sentinel characters, all symbols in j3 are in E U {i?} and all symbols in a are 
compound symbols, and a is historical. 

• After 0ti and ^2$ have been added to the input, is always irreducible and h{aP) is always 
a reduct of ^1X^2 (except temporarily during REDUCE phases). 

• First, TM moves to the right, appending to x. Then it moves to the left, prefixing 0ii to 
xt2$: the tape contents are now the initial redex (jttixt2$, and the current symbol is 0. It enters 
a SHIFT phase. 

For the rest of this description aqP denotes the current configuration, and a is the current 
symbol. 

• In a SHIFT phase, if P — then TM enters its final phase, described below. 

Let k' = ko if a = \ or a — otherwise let k' be the state of D occurring in the rightmost 
symbol [a', k'] in a. TM can remember k'. 

If a = then TM moves right. 

Ifa — B then TM overwrites the current square with [B, k'] and moves right. 

Otherwise a e S: let /c = 5{k',a). If /c is not an accepting state of D then TM overwrites the 
current square with [a, k] and moves right. 

Otherwise, k is an accepting state of D, and the string h{a)a ends in a redex u, so there exists 
a rule u ^ v associated with k. TM enters a REDUCE phase. 

• In a REDUCE phase, h{a)a ends with a redex u, and TM can select a unique rule u ^ v 
to be applied. TM moves left, overwriting the rightmost \v\ symbols of aa with v, extending 
leftwards with blank symbols B, until the square holding the leftmost symbol £ of u (or rather, 
a compound symbol [£, k]) is overwritten, moves one square further left, scanning either [i\ k'] 
or (in which case let k' — ko), moves right, writes [B, k'], and enters a SHIFT phase. 

• In the final phase, P = $ and the tape contents are 0a$, and h{a) is irreducible. TM scans 
leftwards to determine whether or not h{a) = t^, and halts. 

• Let L be the maximum length of all redexes. In a REDUCE phase at most L + 1 nonblank 
symbols are scanned, and the number of blank symbols increases by at least 1. 

• There is one left-sweep at the beginning when TM writes 0ii. Thereafter every left-sweep is a 
reduction and increases the number of blank symbols. 

(2.3) Blank symbols do not affect the outcome. It is very important that the blank symbol B 
carries no information, and once a square becomes blank it remains blank (compound symbols [B, k] 

are also considered blank). If one were to insert extra blank squares at any time, provided that B is 
inserted right of the current square and the appropriate symbols [B, k] are inserted left of the current 
square, the same reductions would be performed. 
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3 Kolmogorov complexity 



We use the following definition of the Kolmogorov complexity K{w) of a bitstring w. 

Let the entire family of 1-tape Turing machines (transducers, converting bitstring inputs to bit- 
string outputs) be encoded as bitstrings and a Universal Turing machine UTM be given. The encoding 
of Turing machines should have the property that if y encodes a Turing machine then no proper prefix 
of y does. In that case, for any bitstring x there exists at most one possible factorisation yz of x such 
that y encodes a Turing machine, call it Ty. 

On input x, UTM tests whether x has a prefix y encoding a Turing machine. If not, it loops. 
Otherwise it simulates Ty on input z where x = yz, either looping or computing Ty{z). 

For any bitstring w, there exists a shortest string yz such that Ty computes w on input z. 

The Kolmogorov complexity K{w) is the length of this shortest string. 

Given bitstrings w, y, z such that w — Ty{z) we say that yz encodes w, or, by abuse of language, 
say that z encodes w, and call z the code and y the decoder. 

If K{w) > \w\ then we call w hard. The lemma below is a fundamental result but very easily 
proved. 

(3.1) Lemma For any m e N, there exists a hard string w of length m. 

Proof. There are 2™ — 1 strings of length < m, so there are at most 2™ — 1 (decoder,code) pairs 
yz such that \yz\ < m. Hence there exists at least one string w of length m not encoded by any of 
them. Q.E.D. 

4 Crossing sequences and information loss on a 1-tape reduction 
machine 

On input x, the reduction machine TM first creates the initial redex 

0^1X^2$- 

Suppose that the initial redex has length n and that the tape squares are labelled 1 to n, beginning 
with the 0. The square initially scanned has index |0ti| + 1. 

In discussing crossing sequences, it helps to consider the 'points separating' adjacent squares. 
There are crossing points between squares i and i + 1 for < i < n. During its computation, TM 
occasionally moves from square i to i + 1, or vice- versa; it is said to cross the i-th crossing point. 
This is possible only ifl<i<n — 1. 

(4.1) Definition Given a factorisation ^tixt2% = uv of the initial redex, the u,v-crossing point is 
the crossing point indexed \u\. Or given a factorisation x = uv of the input string, the u, v -crossing 
point is the crossing point indexed \ ftiu\. 

During a computation of TM, forl<i<n— la crossing sequence develops at the i-th crossing 
point, as follows. 

If i > |0ti I then the first crossing is from left to right, when TM attaches ^2$ to the input string, 
and the second is from right to left before TM attaches 0ii to the input string. If 1 < i < |0ii| then 



the first crossing is from right to left. The next square scanned is the i + 1-st if crossing from left to 
right, otherwise it is the i-th. 

Letpi be the state immediately after the first crossing: the next square scanned is scanned in state 
Pi. After that, the crossing point is crossed in the opposite direction, or possibly never. Let p2 be 
the state immediately after the second crossing, if any. Then let p3 be the state immediately after the 
third crossing, and so on. 

The initial direction of movement across the crossing-point is leftwards (resp., rightwards) if the 
crossing point is left (resp., right) of the initial square. Accordingly crossing sequences begin with a 
single bit s indicating whether the point is left (0) or right (1) of the initial square. 

The sequence 

S,Pi,P2,...,Pk 

is called the crossing sequence at the i-th crossing point, where s is if the i-th crossing point is left 
of the initial square, otherwise 1. 

The bit s is called the leading bit in the crossing sequence. 

The number k is the height of the crossing sequence. It ignores s: a crossing sequence of height 
is a single bit. 

Because of the repeated introduction of blanks, we can establish a notion of when significant 
information has been lost. We call a string y depleted when the number of nonblank symbols falls 
below a certain threshold. (The threshold 1/7 will be good enough.) 

(4.2) Definition Suppose that the alphabet of the Thue system realised by TM contains A symbols. 
Suppose a is fixed, < a < 1. Let 

Let ji < j2 be two crossing points. The tape contents between ji and j2 are depleted (at time t) if the 
string y' between these crossing points satisfies 

\h{y')\<P{j2-ji). 

If y is a distinguished substring of an input string then we say that y is depleted at time t if the 
initial redex (^tixt2$ = uyv and the tape contents become depleted as described, where ji — \u\ and 
J2 = \uy\. In this case, 

\hiy')\ < m- 

The constant (3 is introduced because it is the bit-length of h{y') which matters, that is, the length 
of a bit-string encoding h{y'). The depletion lemma guarantees that h{y') has bit-length < a\y\. 

(4.3) Lemma (Depletion Lemma). There exist constants H and d such that during any computation 
of TM, if two crossing points are at least d squares apart and the height of all crossing sequences at 
and between them is at least H, then the string between these points is depleted. 

Proof. Let L be the maximum length of all redexes. Suppose that at crossing point j, and at 
time t, the crossing sequence has height H or greater. This includes [i?/2j right-to-left movements. 
The first may be when the string 0ti is attached to the input, and another may be the last move in 
a reduce phase, when TM scans the j-th square, which contains [a, k'], say, to ascertain the state k' 



of D. However, at that time the j + 1-st square goes from nonblank to blank, so it happens at most 
once. Apart from these two exceptions, every right-to-left movement across the j-th crossing point 
is during a REDUCE phase and produces more blanks to the left of that point. This happens at least 
[H/2 — 2 J times up to time t. 

Consider a section of at most K — [H/2 — 2j + L — 1 squares ending at the j-th square. So 
long as the section includes L or more nonblank squares, all of these REDUCE phases increase the 
number of blanks in the section. By time t the section contains at most L — 1 nonblank squares. 

Now suppose that the stated threshold holds at all crossing points from the {j — i)-th to the j-th 
inclusive, where i > d. Subdivide the tape between these points into sections of length K plus one 
leftmost section of length between and K — 1. This subdivision produces l^/K] sections. By time 
t, the number of nonblank squares between these crossing points is at most 

(L-m+i) 

K 

K depends directly on H. Choose H large enough so that 

'-^<^- ->^- 

Choose 



^= \yK — 7l- 



L-l 

Then for all £ > d, 

(L-l)(£+l 



1 

- 1 



K 

as required. Q.E.D. 
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5 Cut-and-paste methods 

In a 'cut and paste' method, given an input string x, one replaces a substring v of x with another 
string v', so x = uvw is changed to a string x' = uv'w. Given that the crossing sequences around v 
and v' are compatible, the computations on x and x' should be similar. 

We consider partial computations of M, where M is a 1-tape Turing machine. By 'partial' is 
meant that they begin at initial configurations but do not necessarily end in halting computations. 
Associated with every partial computation is the list of crossing sequences generated by the compu- 
tation. 

Recall that a crossing sequence is a sequence of the form 

s,Pi, ■■■,Pk 

where s is a single bit and pi, . . . are states of M. The leading bit is always given, but if /c = 
then the sequence is considered empty. 

In this section we assume that the squares are indexed so the first square scanned has index 1. 



(5.1) Given an input string x, there is a unique computation (possibly infinite) on input x. Suppose 
the initial tape contents are presented as ax ■■ - cin, where K <1 and > |a;| and ,t = ai . . . a|a;|; the 
other Oj are blank. Assume that in any partial computation under consideration, only squares indexed 
between K and N are scanned, perhaps not all of them. 

(5.2) Now suppose that we are given an alternating list of crossing sequences Cj and symbols Oj, 

ck-i,clk,ck, ■ ■ ■ ,cin,cn, (5.1) 

where the leading bit in Cj is if i < 1 and 1 if i > 1, and the input string x is ai . . . a^^^^. 

Also, those i such that Cj is nonempty form a contiguous (possibly empty) interval, and ck and 
Cat are empty, with leading bits and 1 respectively. 

(5.3) Full verification. Given this data, it is easy to trace the computation on input x and produce a 
sequence of sextuples 

V-l Pr-1 Cir-1 dr j^r Pr, r = 1, 2, . . . 

giving the square scanned and the quintuple applied at the first, second, . . . steps. At the same time 
the procedure can check the state Pr against the relevant crossing sequence (po = Qo is not checked). 

This can be done by maintaining the index of the current square, the current state, and arrays 
Ai, K < i < N and Ii,K — 1 < i < N. The array Ai gives the current tape contents, and Ij gives 
the number of states currently cancelled from q. The procedure is simple and we omit the details. 

The procedure should continue until either 

• it reaches a halting configuration, 

• it attempts to check p,. against a state in some Cj where Jj has reached the height of q, meaning 
that all states in q have been 'cancelled,' or 

• it checks pr against a state in some q and discovers a mismatch. 

In the first two cases, if all states in all the q have been cancelled, it reports 'consistent,' else it 
reports 'inconsistent.' In the third case, it reports 'inconsistent.' 

(5.4) Local verification. Next let us fix some k, K < k < N, and consider how this procedure 
affects the A;-th square: the relevant data and variables are 

k, Cfc-i, h-i, Q, Ak, Ck, Ik- 

Let us suppose, omitting some simple variants, that A; > 2, so the square is first entered from the left. 
When the square is first entered, q has just been cancelled from Ck-i and Ak = a^, and a quintuple 
qaka'jj,q' applies, say. Ak := a', q := q', and the next square scanned is A; ± 1 depending on /x: q' is 
cancelled from Ck or Ck-i as appropriate, and the next time the square is entered, q is taken from Ck 
or Ck-i. The procedure continues until there is a mismatch or it attempts to check q' against Ck-i or 
Ck when all of it has already been cancelled. At this point, if there is a mismatch, or not both these 
crossing sequences have been fully cancelled, it reports 'inconsistent,' else it reports 'consistent.' Let 
us call this procedure a local verification at the k-th square. 
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(5.5) Definition Given the data d5.il) . i.e., c^-i, clk, Ck, ■ ■ ■ , cln, cn, a consecutive triple is a triple 

Cfc-i, ctfc, Ck where K < k < N . 
The consecutive triple 

is compatible if the local verification at the k-th square reports 'consistent.' 

(5.6) Theorem The data l\5.1\) is consistent with a partial computation on input x if and only if for 
each k between K and N the consecutive triple 

Cfc-l, Clfc, Ck 

is compatible. In this case the local verification at k also computes the contents of the k-th square at 
the end of the partial computation, and identifies the unique square at which the partial computation 
ends. 

Proof. If the data in (15.11) is consistent with a partial computation on input x, the local verification 
at every square will have the same effect as the full verification and report 'consistent,' so Cfc_i, Ofc, Ck 
are compatible and the final value of Ak will be the same as in the full verification. 

Granted that the data is consistent, the unique A;-th square at which the partial computation ends 
is easily determined from k, Ck-i, ak, Ck by checking the final head-movement across the k — 1st and 
kth crossing points. 

Otherwise, the full verification would report inconsistency. Suppose it terminates at the A;-th 
square. Up to this point, its actions at the A;-th square are the same as the local verification procedure 
on that square, so the local verification at k will terminate and report inconsistency for the same 
reason, and Ck-i, ak, Ck are incompatible. Q.E.D. 

The Jurdzihski-Lorys proof uses a kind of pumping lemma and a kind of splicing lemma. The 
pumping lemma is 

(5.7) Corollary (Pumping Lemma). Suppose that x is an input string and x = uvw where v ^ X 
and in some partial computation on input x, the u, vw-crossing sequence equals the uv, w-crossing 
sequence. Explicitly, suppose the data 

CK-i,aK,Ck, ■ ■ ■ ,aN,CN, (5.2) 

describes a partial computation on input x. Write and V = Cj+i . . . aj. Let x' = uw = 

ai . . . aiaj+i . . . an. 

Then i < j, Ci and Cj are the u, vw- and uv, w-crossing sequences respectively, and 

ck-i, ax, Ck, ■ ■ ■ , Qj-i, Q, aj, Cj+i, . . . , aN, cn (5.3) 

is produced by a partial computation on input x'. 

Furthermore, ifa'j^... a'^ are the tape contents at the end of the first partial computation, then 

I II I 
. . . a^a^j^^ ■ ■ - a^ 

are the contents at the end of the second. 



Proof. From Theorem 15 ■6[ all triples c^^i, a^, from the list in Equation (15.21) are compatible. 
Since Cj = Cj, the same goes for the list in Equation (15. 3L so they are produced by a partial compu- 
tation on input x'. The remark about the final tape contents also holds because they can be calculated 
by the local verification. Q.E.D. 

The other cut-and-paste result is restricted to our reduction machine TM. Recall (Paragraph 12.11) 
that h is a homomorphism which erases blank symbols, and a blank symbol may differ from the 
specific blank B. 

(5.8) Definition Let TM be a reduction machine with initial redex ^tixt2% = uvw and suppose that 
a computation is executed up to a time T. Let Ci be the u, vw -crossing sequence at that point, and C2 
the uv, w-crossing sequence, and suppose that z is the tape contents between these crossing points at 
time T (i.e., z is the string occupying squares \u\ + 1 to \uv\ at time T). 

If at time T, the square being scanned is one of these squares, write v = a[3 where this square is 
the first in jd and let i = \h{a) \ + 1; otherwise let i = 0. 
Let q be the state at time T. 
Then the data 

\v\,Ci,h{z),C2,i,q 
is called a residue or {u, v, w)-residue (at time T). 

The residue is associated with a distinguished substring v of the initial redex. It includes |f |, q, 
and i, to simplify the 'splicing lemma' (15.101) below. 

(5.9) Lemma Suppose xi and X2 are input strings, and there exist times Ti and T2 such that the 
A, ^tiXit2$, X-residue at time Ti and the the A, ^tiX2t2%-, X-residue at time T2 are the same. Then xi 
and X2 possess the same irreducible reduct, so TM accepts Xi iff it accepts X2. 

Proof. The respective initial redexes lead to configurations at times Ti and T2 which are the 
same except for occurrences of blank symbols, which don't affect the outcome of the computations 
(Paragraphias]). Q.E.D. 

(5.10) Lemma (splicing lemma). Let TM be a reduction machine. Given two computations, with 
inputs factorised as uvw andu'v'w', suppose that at some time t in the first computation, and another 
time t' in the second, the residue ofv in the first coincides with the residue ofv' in the second. Then 
uvw and uv'w possess the same irreducible reduct, so TM either accepts or rejects both strings. 

Proof. Suppose the common residue is \v\,Ci,h{z),C2,i,q. Associated with the first computation 
suppose we have the data 

CK-i,aK,---,aN,CN, (5.4) 
X = ai . . . ttn, and v = Ui . . . aj. Similarly, for the second, we have the data 

(5.5) 

x' = hi . . .bn', and v' = bii . . . bji. We are given that q = c^, and Cj = c'^,. By Theorem 15.61 each 
consecutive triple in both lists of data is compatible. Corresponding to the input x' = uv'w we have 
the list 

(5.6) 



and each consecutive triple in this list is compatible. Therefore by Theorem 15.61 there is a partial 
computation on input x' which produces the crossing sequences (15.61) . 

The residues include the lengths of v and f so v and v' have the same length. 

The tape squares where these partial computations end are determined by the local verifications 
(Theorem 15. 61) . If £ = then the first computation ends outside the range of v, so the third computa- 
tion ends outside the range of v' , at the same square according to the local verifications. Therefore 

at the end of the third computation, the (A, ^t\uv'wt2%., A)-residue is the 
same as the (A, 0tiMf wt2$, A)-residue at the end of the first computation. 

From Lemma [S!9l uvw and uv'w have the same irreducible reduct. 

Let z and z' be the string in the squares originally occupied by v and v' in the first two computa- 
tions. 

If £ > then the first and second computations end at positions k and k\ say, within the ranges 
of V and v' respectively. Factorise z as (x(5 where |a| = and z as where |a;'| = k' . Then from 
the residue, hip) = h{a') and h{(3) = h{(3'). Again we reach the conclusion (*), so uvw and uv'w 
have the same irreducible reduct. Q.E.D. 



6 Jurdzinski and Lorys' proof 

Given a 1-tape reduction machine accepting all bitstring palindromes, in particular it accepts all 
palindromes of the form 

{ww^'f'^^ 

where ww^ is hard. Jurdzinski and Lorys II7I8II showed that no deterministic 2-stack automaton can 
recognise this set, and their arguments can be applied unchanged to the 1-tape reduction machine 
TM. 

The string 

0tl WW^ . . . WW^ ^2$ 

can be viewed as 2i + 3 blocks indexed from to 2i + 2. The middle block is indexed i + 1. Block 
is 0ti and block 2z + 2 is t2$, and for 1 < j < 2z + 1, the j-th block is the j-th occurrence of 
ww^. Blocks and 2i + 2 are the outer blocks, and the others are inner blocks. We suppose that the 
machine TM recognises the set of palindromes and derive a contradiction. 

The crucial lemma is the Middle Block Lemma, [6^ below. A parameter H will be fixed accord- 
ing to the Depletion Lemma above; in fact the depletion threshold a = 1/7 will be good enough. 
We first establish a pumping result, because it effects the choice of constants in the Middle Block 
Lemma. 

Define Q as the smallest integer such that TM has fewer than 2*^ states. All states can be repre- 
sented as Q-bit patterns, and there is an extra pattern to represent 'no state,' used for padding. Then 
every crossing sequence of height < H can be encoded as a string of QH + 1 bits. 

(6.1) Lemma (pumping effect). Let H be fixed and Q defined as above, and let w be a hard string 
of length m. Given input x = {ww^y^'^^ where i > 8m x 2*^^+^, suppose at a certain time t in 
the computation, within each inner block to the left of the middle block there is at least one crossing 
sequence of height < H. 
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Then x can be factorised as uiU2U^, so that the shorter string x' = uiUs is also of the form 
{ww^Y"^ and has the property described in Corollary 15. 71 i.e., at some time t' in the computa- 
tion on input x', the crossing sequences are the same as corresponding crossing sequences in the 
computation on x at time t, and the tape contents agree outside the region originally occupied by U2. 

Sketch proof. For each j, 1 < j < choose a crossing point kj in the j-th block where the 
crossing sequence has height < H. A crossing point belongs to a block if it is between the crossing 
points bounding the block, or coincides with one of them. Perhaps some crossing points are counted 
twice, but no crossing point is counted more than twice, and therefore there are more than 4m x 2^^^+^ 
crossing points chosen. The residues kj mod 4m fall into 4m classes and therefore there exists an r, 
< r < 4m, such that the set 

{j : kj = r mod 4m} 

contains more than 2*5^+^ indices. This gives more than 2^^^^ crossing points where the crossing 
sequences at time t have height < H. There are at most 2*^^+^ such sequences, so the same sequence 
must occur at two crossing points, call them ki and k2, where 4m divides k2 — ki. These crossing 
points are in the region of tape originally occupied by the input string. 

Factorise x as U1U2U3 where |0tiMi| = ki and \u2\ = k2 — ki. Since the Mi,M2'U3- and uiU2,U3- 
crossing sequences match, this factorisation has the properties described in Corollary 15. 7[ Because x 
is an odd power of ww^ and \u2\ is a multiple of 4m = 2\ww^\, uiu^, is also an odd power of ww^, 
so ^1^3 = as asserted. Q.E.D. 

(6.2) Note. The above lemma will be combined with Lemma 16.41 to derive a contradiction. Ac- 
cording to the Middle Block Lemma, if m is sufficiently large then the middle block is the first to be 
depleted on input x = (ww^Y^^^. That is, the middle block has reached depletion level and no other 
block has. Consider the string x' = (ww^Y^ It was formed as follows: in the original computa- 
tion, the tape was divided into regions A, B,C, and the region B was deleted. Also, the middle block 
is entirely in the region C. In the second computation, the tape has regions A' and C corresponding 
to A and C. The block corresponding to the middle block is entirely in the region C All blocks in 
A' and C correspond to blocks in A and C. There may be one other block in x' to consider, namely, 
a block straddling A' and C, which does not correspond to a block in the first computation. This will 
be considered again in the proof of the main result. 

(6.3) Prefix encoding of numbers. We need to encode numbers such as i as bitstrings so that no 
encoding is a proper prefix of another encoding. This is easily done. Given a positive integer r, first 
represent it as a binary number s with leading bit 1. Let q be the homomorphism t-^ 00, 1 1-^ OL 
Then r is represented as 

g(s)n. 

Also, can be represented as 11. This encoding has the prefix property, and uses fewer than 4 + 
21og2(r + 1) bits. 

In applying the Depletion Lemma, we take the depletion level a to be 1/7. Recall that P = 
a/ [log2 A] where A is the size of the Thue system alphabet. 

(6.4) Lemma (Middle block lemma.) Given input x = {ww where z < 9m x 2^^+\| let the 
computation continue until some inner block is depleted, but only one, at time t, say. Then ifm = \w\ 
is large enough, and the depletion level is 1/7, the block must be the middle block. 

•^With this bound on i, Lemma |6T| can be used later. 



Proof. Suppose the j-th block is the first inner block to become depleted, and j i + 1. For 
clarity we suppose j < i + 1. We consider the three blocks j — 1, j, j + 1 together. 

The case j = I should be treated separately. Assume j > 1 so we have three consecutive inner 
blocks. By the depletion lemma (14.31) . there exists a crossing point in the (j — l)st block where the 
crossing sequence has height < H. Choose the rightmost such crossing point within the (j — l)st 
block, and let its index be ji. Here |0ti| + (j — 2) x 2m < ji < |0ti| + (j — 1) x (2m). Similarly 
let j2 index the leftmost crossing sequence, in the (j + l)st block, whose height is < H. 

Consider the following data. 

m,i,jJi,Cij2,C2, h{y'),i,q (6.1) 

where y' is the string between crossing points indexed ji and j2 at time t, and ci and C2 are the 
crossing sequences at that time at those points. Also, £ indicates the relative position of the and q is 
the state reached, as given in a residue (Definition 15. 8 1) . 

Note that j2 — ji < 6m, so by the depletion lemma (14.31) . h(y') can be encoded as a bitstring of 
length < 6m/7. The crossing sequences can be encoded as bitstrings of length QH + 1, and numbers 
m, i, etcetera, are 0{m). The numbers can be encoded as discussed in Paragraph [63] above, allowing 
all the data to be encoded uniquely in a bitstring z of length 

\z\ < — — h O(logm). 

It is straightforward to consider in turn every string w' of length m, and determine whether, on 
input {w'w'^y^^^, the j-th block is the first to become depleted, and if so, whether the residue matches 
the given information. If there is only one such string w', then w' = w, so we have a way to generate 
w from the given information. Suppose that Ty is a Turing machine constructing w from z. Then yz 
encodes w. If m is sufficiently large, then \yz\ < m, contradicting the fact that w is hard. 

Therefore there exists another string w' of length m which is consistent with the information 
stored in z. Then there exist factorisations 

<^ti{wW^f'^H2$ = tvu 

and 

^h{w'w"'f'+H2% = t'v'u' 

where \t\ = \t'\, \v\ = \v'\, and |m| = \u'\, and at corresponding points in the computations the t, u, v- 
residue matches the t', m', f '-residue. Then TM accepts tv'u (Lemma [5. 101 ). However, in the string 
tv'u, the j-th block w'w'^ in v' has its mirror image in u, which is of the form ww^, so tv'u is not a 
palindrome, a contradiction. 

The analysis is much the same if the depleted block is indexed 1, since block indexed zero is the 
same for all input strings. We conclude that the middle block is the first to be depleted. Q.E.D. 

(6.5) Theorem (Jurdzinski-Lorys.) Pal is not a Church-Rosser language. 

Proof. Otherwise there is a reduction machine TM as described, and an input string 

X = {ww^f'+\ i = 9m X 2<3^+\ 
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By Lemma (631 at some time during the computation on input x, the middle block becomes depleted, 
but no other block is depleted. By Lemma [6?T1 the tape can be divided into regions A, B, C, where C 
contains the middle block, and there exists a shorter string x' = [ww^Y''- +^ obtained by deleting B. 
Correspondingly, the tape with input x' is divided into regions A' and C . According to the lemma, 
there exists a time t' in the computation on input x' where the crossing sequences and tape contents 
in A and C at time t correspond exactly to those in A' and C at time t' . In the original computation 
(at time t), only the middle block is depleted. Since all blocks in the second computation at time 
t' , except perhaps one block straddling A' and C , are the same as in the first at time t, in the latter 
computation at least one block is depleted and at most two. One corresponds to the original middle 
block and is in region C", to the left of centre. The other straddles A' and C and is also to the left 
of centre. One of these blocks is the first to be depleted in the second computation, contradicting 
Lemma [63] for x' . Q.E.D. 

7 Application to non-squares 

It is relatively easy to prove a result of Buntrock and Otto's flU that the set 

L = {xG{0,ir : {^y)x^y^} 

is not a CRL. Here is a sketch proof. 
Consider bitstrings of the form 

where w is hard. The string 0U7^$ reduces to some string a which causes to be rejected, since 
^ L. Consider the first block (occurrence of w) to be depleted in the computation. Repeating the 
arguments of this paper, whatever is the first block to be depleted, the data 

does not determine w uniquely if w is hard. There is another string w' of the same length which can 
replace one occurrence of w, where by Lemma [STTOl the altered string reduces to a and is rejected, 
whereas it belongs to L. | 
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